SOSD-Net: Joint semantic object segmentation and depth estimation from monocular images

نویسندگان

چکیده

Depth estimation and semantic segmentation play essential roles in scene understanding. The state-of-the-art methods employ multi-task learning to simultaneously learn models for these two tasks at the pixel-wise level. They usually focus on sharing common features or stitching feature maps from corresponding branches. However, lack in-depth consideration correlation of geometric cues parsing. In this paper, we first introduce concept objectness exploit relationship through an analysis imaging process, then propose a Semantic Object Segmentation Estimation Network (SOSD-Net) based assumption. To best our knowledge, SOSD-Net is network that exploits geometry constraint simultaneous monocular depth segmentation. addition, considering mutual implicit between tasks, iterative idea expectation–maximization algorithm train proposed more effectively. Extensive experimental results Cityscapes NYU v2 dataset are presented demonstrate superior performance approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian depth estimation from monocular natural images.

Estimating an accurate and naturalistic dense depth map from a single monocular photographic image is a difficult problem. Nevertheless, human observers have little difficulty understanding the depth structure implied by photographs. Two-dimensional (2D) images of the real-world environment contain significant statistical information regarding the three-dimensional (3D) structure of the world t...

متن کامل

Joint Semantic Segmentation and 3D Reconstruction from Monocular Video

We present an approach for joint inference of 3D scene structure and semantic labeling for monocular video. Starting with monocular image stream, our framework produces a 3D volumetric semantic + occupancy map, which is much more useful than a series of 2D semantic label images or a sparse point cloud produced by traditional semantic segmentation and Structure from Motion(SfM) pipelines respect...

متن کامل

Joint Depth Estimation and Segmentation from Multi-view Images using the Expectation-Minimization Algorithm

An algorithm for joint depth estimation and segmentation from multi-view images is presented. The distribution of the luminance of each image pixel is modelled as a random variable, which is approximated by a “mixture of Gaussians model”. After recovering 3-D motion, a reference image is segmented into a fixed number of regions, each characterized by a distinct affine depth model with three par...

متن کامل

Monocular Depth Estimation: Applications to Image Segmentation and Filtering

This Ph.D. dissertation addresses the problem of estimating depth ordering information from single images, a key issue in image understanding that in recent years has focused the interest of the community. Motivation behind this tendency is provided by several important applications that could beneficiate of advances in the field such as automatic object removal, image indexing, 3D scene recons...

متن کامل

J-MOD$^{2}$: Joint Monocular Obstacle Detection and Depth Estimation

In this work, we give a new twist to monocular obstacle detection. Most of the existing approaches either rely on Visual SLAM systems or on depth estimation models to build 3D maps and detect obstacles. Despite their success, these methods are not specifically devised for monocular obstacle detection. In particular, they are not robust to appearance and camera intrinsics changes or texture-less...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2021

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2021.01.126